comparison matrix
An experimental approach: The graph of graphs
Szádoczki, Zsombor, Bozóki, Sándor, Sipos, László, Galambosi, Zsófia
One of the essential issues in decision problems and preference modeling is the number of comparisons and their pattern to ask from the decision maker. We focus on the optimal patterns of pairwise comparisons and the sequence including the most (close to) optimal cases based on the results of a color selection experiment. In the test, six colors (red, green, blue, magenta, turquoise, yellow) were evaluated with pairwise comparisons as well as in a direct manner, on color-calibrated tablets in ISO standardized sensory test booths of a sensory laboratory. All the possible patterns of comparisons resulting in a connected representing graph were evaluated against the complete data based on 301 individual's pairwise comparison matrices (PCMs) using the logarithmic least squares weight calculation technique. It is shown that the empirical results, i.e., the empirical distributions of the elements of PCMs, are quite similar to the former simulated outcomes from the literature. The obtained empirically optimal patterns of comparisons were the best or the second best in the former simulations as well, while the sequence of comparisons that contains the most (close to) optimal patterns is exactly the same. In order to enhance the applicability of the results, besides the presentation of graph of graphs, and the representing graphs of the patterns that describe the proposed sequence of comparisons themselves, the recommendations are also detailed in a table format as well as in a Java application.
- Europe > Hungary > Budapest > Budapest (0.05)
- Asia > Japan (0.04)
- North America > United States > New York (0.04)
ConSim: Measuring Concept-Based Explanations' Effectiveness with Automated Simulatability
Poché, Antonin, Jacovi, Alon, Picard, Agustin Martin, Boutin, Victor, Jourdan, Fanny
Concept-based explanations work by mapping complex model computations to human-understandable concepts. Evaluating such explanations is very difficult, as it includes not only the quality of the induced space of possible concepts but also how effectively the chosen concepts are communicated to users. Existing evaluation metrics often focus solely on the former, neglecting the latter. We introduce an evaluation framework for measuring concept explanations via automated simulatability: a simulator's ability to predict the explained model's outputs based on the provided explanations. This approach accounts for both the concept space and its interpretation in an end-to-end evaluation. Human studies for simulatability are notoriously difficult to enact, particularly at the scale of a wide, comprehensive empirical evaluation (which is the subject of this work). We propose using large language models (LLMs) as simulators to approximate the evaluation and report various analyses to make such approximations reliable. Our method allows for scalable and consistent evaluation across various models and datasets. We report a comprehensive empirical evaluation using this framework and show that LLMs provide consistent rankings of explanation methods. Code available at https://github.com/AnonymousConSim/ConSim.
- Europe > France > Occitanie > Haute-Garonne > Toulouse (0.04)
- North America > United States > New York > New York County > New York City (0.04)
- North America > United States > Michigan (0.04)
Classifying Mental-Disorders through Clinicians Subjective Approach based on Three-way Decision
Wang, Huidong, Sourav, Md Sakib Ullah, Yang, Mengdi, Zhang, Jiaping
Prevalence can be seen as having a lack of motivation to live, losing interest in everything among common people. Hence, they are frequently thriving towards psychiatric diagnosis than in the past days. Therefore, improper diagnosis of mental health disorders may lead to even more vulnerable consequences in a greater sense from an individual to a social perspective [38]. The traditional form of psychiatric diagnosis is much pretentious nowadays as few recent studies demonstrate several shortcomings within the widely established systems used for classifying mental disorders, namely, bipolar disorder, anxiety disorders, phobias, substance use disorder, mood disorders, and many others [2,3]. More often these recognized tools, such as DSM-5 [7] and ICD-11 [8], fails to distinguish between the proper and correct disorder diagnosis of a complex phenomenon in individual cases.
- North America > United States > New York > New York County > New York City (0.04)
- North America > United States > Maryland > Montgomery County > Rockville (0.04)
- Asia > China > Shanghai > Shanghai (0.04)
- (3 more...)
Prayatul Matrix: A Direct Comparison Approach to Evaluate Performance of Supervised Machine Learning Models
Performance comparison of supervised machine learning (ML) models are widely done in terms of different confusion matrix based scores obtained on test datasets. However, a dataset comprises several instances having different difficulty levels. Therefore, it is more logical to compare effectiveness of ML models on individual instances instead of comparing scores obtained for the entire dataset. In this paper, an alternative approach is proposed for direct comparison of supervised ML models in terms of individual instances within the dataset. A direct comparison matrix called \emph{Prayatul Matrix} is introduced, which accounts for comparative outcome of two ML algorithms on different instances of a dataset. Five different performance measures are designed based on prayatul matrix. Efficacy of the proposed approach as well as designed measures is analyzed with four classification techniques on three datasets. Also analyzed on four large-scale complex image datasets with four deep learning models namely ResNet50V2, MobileNetV2, EfficientNet, and XceptionNet. Results are evident that the newly designed measure are capable of giving more insight about the comparing ML algorithms, which were impossible with existing confusion matrix based scores like accuracy, precision and recall.
- North America > United States > California (0.04)
- Europe > United Kingdom > England > Greater London > London (0.04)
- Europe > Germany (0.04)
- (2 more...)
- Government (0.46)
- Information Technology (0.46)
Heuristic Rating Estimation Method for the incomplete pairwise comparisons matrices
Kułakowski, Konrad, Kędzior, Anna
The Heuristic Rating Estimation Method enables decision-makers to decide based on existing ranking data and expert comparisons. In this approach, the ranking values of selected alternatives are known in advance, while these values have to be calculated for the remaining ones. Their calculation can be performed using either an additive or a multiplicative method. Both methods assumed that the pairwise comparison sets involved in the computation were complete. In this paper, we show how these algorithms can be extended so that the experts do not need to compare all alternatives pairwise. Thanks to the shortening of the work of experts, the presented, improved methods will reduce the costs of the decision-making procedure and facilitate and shorten the stage of collecting decision-making data.
- North America > United States > Michigan (0.04)
- Europe > Poland (0.04)
- Asia > Japan > Honshū > Kantō > Saitama Prefecture > Saitama (0.04)
- (3 more...)
Extension of Saaty's inconsistency index to incomplete comparisons: Approximated thresholds
Ágoston, Kolos Csaba, Csató, László
Pairwise comparison matrices are increasingly used in settings where some pairs are missing. However, there exist few inconsistency indices to analyse such incomplete data sets and even fewer measures have an associated threshold. This paper generalises the inconsistency index proposed by Saaty to incomplete pairwise comparison matrices. The extension is based on the approach of filling the missing elements to minimise the eigenvalue of the incomplete matrix. It means that the well-established values of the random index, a crucial component of the consistency ratio for which the famous threshold of 0.1 provides the condition for the acceptable level of inconsistency, cannot be directly adopted. The inconsistency of random matrices turns out to be the function of matrix size and the number of missing elements, with a nearly linear dependence in the case of the latter variable. Our results can be directly used by practitioners as a statistical criterion for accepting/rejecting an incomplete pairwise comparison matrix.
- Europe > Hungary > Budapest > Budapest (0.05)
- North America > United States > New York (0.04)
- Asia > Japan (0.04)
Entropy production rate as a criterion for inconsistency in decision theory
Individual and group decisions are complex, often involving choosing an apt alternative from a multitude of options. Evaluating pairwise comparisons breaks down such complex decision problems into tractable ones. Pairwise comparison matrices (PCMs) are regularly used to solve multiple-criteria decision-making (MCDM) problems, for example, using Saaty's analytic hierarchy process (AHP) framework. However, there are two significant drawbacks of using PCMs. First, humans evaluate PCMs in an inconsistent manner. Second, not all entries of a large PCM can be reliably filled by human decision makers. We address these two issues by first establishing a novel connection between PCMs and time-irreversible Markov processes. Specifically, we show that every PCM induces a family of dissipative maximum path entropy random walks (MERW) over the set of alternatives. We show that only `consistent' PCMs correspond to detailed balanced MERWs. We identify the non-equilibrium entropy production in the induced MERWs as a metric of inconsistency of the underlying PCMs. Notably, the entropy production satisfies all of the recently laid out criteria for reasonable consistency indices. We also propose an approach to use incompletely filled PCMs in AHP. Potential future avenues are discussed as well. keywords: analytic hierarchy process, markov chains, maximum entropy
An application of incomplete pairwise comparison matrices for ranking top tennis players
Bozóki, Sándor, Csató, László, Temesi, József
Pairwise comparison is an important tool in multi-attribute decision making. Pairwise comparison matrices (PCM) have been applied for ranking criteria and for scoring alternatives according to a given criterion. Our paper presents a special application of incomplete PCMs: ranking of professional tennis players based on their results against each other. The selected 25 players have been on the top of the ATP rankings for a shorter or longer period in the last 40 years. Some of them have never met on the court. One of the aims of the paper is to provide ranking of the selected players, however, the analysis of incomplete pairwise comparison matrices is also in the focus. The eigenvector method and the logarithmic least squares method were used to calculate weights from incomplete PCMs. In our results the top three players of four decades were Nadal, Federer and Sampras. Some questions have been raised on the properties of incomplete PCMs and remains open for further investigation.
- Europe > Hungary > Budapest > Budapest (0.05)
- North America > United States > New York (0.04)
- Asia > Japan (0.04)
Evaluation and selection of Medical Tourism sites: A rough AHP based MABAC approach
Roy, Jagannath, Chatterjee, Kajal, Bandhopadhyay, Abhirup, Kar, Samarjit
High costs of treatment, long waiting time, affordability of airfares to overseas destinations and favorable exchange rate change are crucial factors related to the fast growth of Medical Tourism (Connell, 2006). Rapid development of medical infrastructure with international standards and certification, easy availability of skilled manpower bring South Asian countries like Thailand, Malaysia, and India at the forefront in this area. With current annual growth of 13.0 percent, the Indian health care sector contributes about $ 23 billion (nearly 4 percent of GDP) to the Indian economy, with'foreign exchange earning around $1.8 billion' (Chakraborty, 2006). Although research studies are abundant focusing on social impacts of Medical Tourism, there is no proper methodology for customers, both foreign and domestic, to assess the medical tourist destination in any country. The problem can be solved by taking the interest of stakeholder's in assessing the weights of a multiple criteria set, namely medical infrastructure, logistics service providers, 1 government policy along with city demography. Therefore, assessment of desirable medical destination selection and evaluation problem can be considered decision making problem with multiple attributes varying from consumer demands to resource constraints of medical related industry. In this regard, MCDM has become a very crucial area of management research and decision theory with lots of methods developed, extended and modified in solving problems in the present and past few decades.
- Asia > Thailand (0.24)
- Asia > Malaysia (0.24)
- Asia > India > Tamil Nadu > Chennai (0.06)
- (7 more...)
- Consumer Products & Services > Travel (1.00)
- Banking & Finance (1.00)
Rank Aggregation via Low-Rank and Structured-Sparse Decomposition
Pan, Yan (Sun Yat-sen University) | Lai, Hanjiang (Sun Yat-sen University) | Liu, Cong (Sun Yat-sen University) | Tang, Yong (South Normal University of China) | Yan, Shuicheng (National University of Singapore)
Rank aggregation, which combines multiple individual rank lists toobtain a better one, is a fundamental technique in various applications such as meta-search and recommendation systems. Most existing rank aggregation methods blindly combine multiple rank lists with possibly considerable noises, which often degrades their performances. In this paper, we propose a new model for robust rank aggregation (RRA) via matrix learning, which recovers a latent rank list from the possibly incomplete and noisy input rank lists. In our model, we construct a pairwise comparison matrix to encode the order information in each input rank list. Based on our observations, each comparison matrix can be naturally decomposed into a shared low-rank matrix, combined with a deviation error matrix which is the sum of a column-sparse matrix and a row-sparse one. The latent rank list can be easily extracted from the learned low-rank matrix. The optimization formulation of RRA has an element-wise multiplication operator to handle missing values, a symmetric constraint on the noise structure, and a factorization trick to restrict the maximum rank of the low-rank matrix. To solve this challenging optimization problem, we propose a novel procedure based on the Augmented Lagrangian Multiplier scheme. We conduct extensive experiments on meta-search and collaborative filtering benchmark datasets. The results show that the proposed RRA has superior performance gain over several state-of-the-art algorithms for rank aggregation.
- Asia > China > Guangdong Province > Guangzhou (0.04)
- North America > United States > New York (0.04)
- Asia > Singapore > Central Region > Singapore (0.04)
- (2 more...)